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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Choo, Yen 

Klug, Aaron 

Sanchez Garcia, Isidro 



(ii) TITLE OF INVENTION: Improvements in or Relating to 
Binding Proteins for Recognition of DNA 



(iii) NUMBER OF SEQUENCES: 18 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pillsbury Madison & Sutro, L.L.P. 

(B) STREET: 1100 New York Avenue, N.W. 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: USA 

(F) ZIP: 20005-3918 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Word Perfect 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER : 0 8 7 93 4 0 8 

(B) FILING DATE: 1997-06-02 
(C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/GB9 5 / 0 1 94 9 

(B) FILING DATE: 17-AUG-1995 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9514698.1 

(B) FILING DATE: 18-JUL-1995 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9422534.9 

(B) FILING DATE: 08-NOV-1994 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9416880.4 

(B) FILING DATE: 20-AUG-1994 



(2) INFORMATION FOR SEQ ID NO: 1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CTCCTGCAGT TGGACCTGTG CCATGGCCGG CTGGGCCGCA TAGAATGGAA 5 0 

CAACTAAAGC 6 0 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Glu Glu Arg Pro Tyr Ala Cys Pro 

5 10 

Val Glu Ser Cys Asp Arg Arg Phe Ser Arg 

15 20 

Ser Asp Glu Leu Thr Arg His lie Arg lie 

25 30 

His Thr Gly Gin Lys Pro Phe Gin Cys Arg 

35 40 

lie Cys Met Arg Asn Phe Ser Xaa Xaa Xaa 

45 50 

Xaa Leu Xaa Xaa His Xaa Arg Thr His Thr 

55 60 

Gly Glu Lys Pro Phe Ala Cys Asp lie Cys 

65 70 

Gly Arg Lys Phe Ala Arg Ser Asp Glu Arg 

75 80 

Lys Arg His Thr Lys lie His Leu Arg Gin 

85 90 

Lys Asp 

(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TATGACTTGG ATGGGAGACC GCCTGG 2 6 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
AATTCCAGGC GGTCTCCCAT CCAAGTCA 2 8 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TATATAGCGT GGGCGTATAT A 21 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GCGTATATAC GCCCACGCTA TATA 2 4 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TATATAGCGN NNGC GT AT AT A 21 



(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GCGTATATAC GCNNNCGCTA TATA 2 4 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TTCCATGGAG ACGCAGAAGC CCTTCAGCGG CCA 3 3 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TTCCATGGAG ACGCAGGTGA GTTCCTCACG CCA 3 3 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CCCCTTTCTC TTCCAGAAGC CCTTCAGCGG CCA 33 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 

Met Ala Glu Glu Lys Pro Phe Gin Cys Arg 

5 10 

lie Cys Met Arg Asn Phe Ser Asp Arg Ser 

15 20 

Ser Leu Thr Arg His Thr Arg His Thr Gly 

25 30 

Glu Lys Pro 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

Met Ala Glu Glu Lys Pro Phe Gin Cys Arg 

5 10 

lie Cys Met Arg Asn Phe Ser Glu Arg Gly 

15 20 

Thr Leu Ala Arg His Glu Lys His Thr Gly 

25 30 

Glu Lys Pro 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

Phe Gin Cys Arg lie Cys Met Arg Asn Phe 

5 10 

Ser Gin Gly Gly Asn Leu Val Arg His Leu 

15 20 



Arg His Thr Gly Glu Lys Pro 

25 



INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

Phe Gin Cys Arg lie Cys Met Arg Asn Phe 

5 10 

Ser Gin Ala Gin Thr Leu Gin Arg His Leu 

15 20 

Lys His Thr Gly Glu Lys 

25 

INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 

Phe Gin Cys Arg lie Cys Met Arg Asn Phe 

5 10 

Ser Gin Ala Ala Thr Leu Gin Arg His Leu 

15 20 

Lys His Thr Gly Glu Lys 

25 

INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 



Phe Gin Cys Arg lie Cys Met Arg Asn Phe 



10 



Ser Gin Ala Gin Asp Leu Gin Arg His Leu 

15 20 

Lys His Thr Gly Glu Lys 

25 

INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 

Met Ala Glu Glu Lys Pro Phe Gin Cys Arg 

5 10 

lie Cys Met Arg Asn Phe Ser Asp Arg Ser 

15 20 

Ser Leu Thr Arg His Thr Arg Thr His Thr 

25 30 

Gly Glu Lys Pro Phe Gin Cys Arg lie Cys 

35 40 

Met Arg Asn Phe Ser Asp Arg Ser His Leu 

45 50 

Thr Arg His Thr Arg Thr His Thr Gly Glu 

55 60 

Lys Pro Phe Gin Cys Arg lie Cys Met Arg 

65 70 

Asn Phe Ser Asp Arg Ser Asn Leu Thr Arg 

75 80 

His Thr Arg Thr His Thr Gly Glu Lys 

85 



